#AI Datasets
Explore tagged Tumblr posts
onepawproductions · 2 years ago
Text
Taylor Hebert v3
Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media
Taylor in the Trainyard at Night, liminal spaces
Expand for Ai Error, my little rant, and more!
Tumblr media
So, fun fact: I've been hard at work creating a sharable embedding for Taylor Hebert (what I picture her looking like, which is a mix of Claudia Black, and about three different awesome women I've known throughout my life)
After 10 hours of failed attempt, I finally got it dialed in, and it was producing fantastic pics of her face in various emotions and poses
Tumblr media Tumblr media
So I thought... Awesome! Let's try it in txt2img!!
Aaaaand, then cue this straight -up Hot Garbage:
Tumblr media Tumblr media Tumblr media Tumblr media
What. The. Heck.
Went back to SDXL (which cannot train embeddings), and got good images again, but not a reproducible face, body, and expressions.
Okays, so back to the training on SD1. 5:
Tumblr media Tumblr media Tumblr media
And TayTay is cute again! Nerdy, pale, a cloud of dark hair. Perfect for Pre-Powers Taylor!
So why the hot mess?
I'll let Taylor express my feelings on the matter:
Tumblr media
AAAAAARGH.
But it is now time for all good computery programmery artists to go to bed. Dang. So close!
Tomorrow is back to Chapter Art for The Muddy Princess. Chapter 17 is finished, and just needs it's art accompaniment for the Teaser posting!
Meanwhile: enjoy the Taylor montage!:
Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media
4 notes · View notes
Text
0 notes
gts-ai · 11 months ago
Text
Tumblr media
Harness the power of AI and machine learning to unlock the full potential of video transcription. Our advanced video transcription services deliver unparalleled accuracy and efficiency, transforming spoken words into text with precision. This innovative solution enhances the performance of your machine learning models, providing high-quality data for better insights and decision-making. Ideal for industries such as media, education, healthcare, and legal, our AI-driven transcription services streamline data processing, saving time and reducing costs. Experience the future of video transcription and elevate your data-driven projects with our cutting-edge technology.
0 notes
victusinveritas · 25 days ago
Text
Tumblr media
Art by Toothy Bj.
15 notes · View notes
javert · 2 months ago
Text
i want to be in your dataset. put me in your dataset. let me innnnnn
13 notes · View notes
phlebaswrites · 2 months ago
Note
Will filing a DCMA takedown mean that the jackass behind the theft will see my legal name and contact info?
I'm not a lawyer so I can't say for sure, but I think it's likely.
For starters, the takedown notice will go to the company so they'll definitely see your details.
nyuuzyou (the person claiming ownership of the dataset into which they've processed all our unlocked works on AO3) has already clearly indicated that they believe they're in the right, and they're willing to fight against the takedown notices - they filed a counter notice to say as much right after OTW filed the first takedown notice with huggingface (the website to which nyuuzyou uploaded the dataset).
They also tried to upload the dataset to two other websites (it's thankfully now been removed).
Given that, it's possible (though I can't judge how likely) that these takedown notices might end up in a court of law somewhere, and in such a case nyuuzyou will definitely have access to them - and all of our IRL names.
This is one of the hazards of DMCA takedown notices, leaving fanwork creators to choose between protecting our creations or connecting our IRL and fannish identities at the risk of doxing. It is also why I've been careful not to say that we must all file takedown notices, in fact I think that anyone who is in a vulnerable situation most emphatically MUST NOT.
Let me be clear.
DO NOT DO THIS IF IT WILL HURT YOU.
Instead, leave that up to fans like myself who have less to lose and are willing to take that risk.
Right now, what we are doing is engaging in both a legal fight but also something of a public awareness campaign.
The huggingface site that is currently hosting this dataset is actually one facet of Hugging Face, Inc. a well known French-American company based in New York City that works in the machine learning space. I can't imagine that they want to be known as bad faith actors who host databases full of stolen material. They are a private company right now, but if their founders ever want to go public (and make a lot of money selling their shares) they would prefer not to be the subject of bad press. I make a note that they might already be preparing for an IPO since their stocks seem to be available for purchase on the NASDAQ private market and they raised $235 million in their series D funding round. This is a company that is potentially valued at $4.5 billion - they have bigger fish to fry than a bunch of members of the public conducting the legal equivalent of a DDoS on them.
Because that's effectively what we're doing - we are snowing them under with takedown notices that have to be individually replied to and dealt with. We are trying to convince huggingface that deleting the dataset nyuuzyou uploaded is the easier and less problematic option than legally defending nyuuzyou's right to post it.
The other thing that we're doing is making a public anti-AI stand.
We are telling the LLM / Gen AI community that AO3 is not the soft target it might look like - they might be able to crawl the site against site rules and community standards but if they post their datasets publicly for street cred (and that's exactly what nyuuzyou is doing) then we will act to protect ourselves.
The status of fanwork as a legally valid creative pursuit - to be protected and cherished like any other - is a long campaign, and one that the OTW was founded on. When @astolat first proposed AO3, it was the next step in a fight that had been ongoing for years.
I'd been a fan for over a decade before AO3 was founded and I personally don't intend to see it fall to this new wave of assaults.
Though it is interesting to be on this end of a takedown notice for once in my life! 🤣
11 notes · View notes
quietwingsinthesky · 10 months ago
Note
how much of your writing is ai generated
ngl anon kind of fucking rude to come here and accuse me of that. i don’t make fic just to rack up some arbitrary numbers, be that wordcount or idk, kudos. i make fic because i fucking care about what i’m writing about. if i didn’t, i wouldn’t write it, i certainly wouldn’t post it. AI fic is a plague on fandom for plagiarism reasons, obviously, but also because why should anyone give a shit about your writing if you didn’t? I don’t care if one day we have AI that makes stuff identical in quality to what people can, or better, even, because the words on the page aren’t the point, it’s always about the reason behind putting them down.
So, to answer your question, none of it. And it never will be. I’d rather never write again than stoop to that. And I certainly think less of anyone who does it themselves.
24 notes · View notes
marianarira · 1 year ago
Text
Tumblr media Tumblr media
I tried nightshade and glaze with this painting from 2019!
Protect your images from genAI with Glaze! Paintings, photos, 3D renders... everything! Tell your friends!
32 notes · View notes
chronomally · 3 months ago
Text
Death by a thousand cuts in my class this week
6 notes · View notes
torchickentacos · 7 months ago
Text
I'm getting so sick and tired of AI. Someone needs to start mass-feeding it eng dub AG pokemon scripts only or something. Make it watch Spontaneous Combusken 87934573489 times. I think that would sufficiently poison its datasets if we could do it enough times.
18 notes · View notes
ao3scrapesearch · 1 month ago
Note
Thanks a lot for making this tool, it helps a lot to know exactly which fics were scraped, even if there isn't anything to be done for now. I do want to ask how you managed to grab the data on which fics were scraped? Was it downloaded before the takedown? (This is a purely curious question feel free to ignore)
You're welcome!
It has a whole 11 notes, so I don't think many people SAW it, but I went through how I made the tool right here.
tl;dr: it's based on the metadata-only set someone else linked to on the Hugging Face comments, but I did download the full original dataset as well to confirm with a select few fics.
4 notes · View notes
thesuncantreachyouhere · 9 days ago
Note
who/what is the song in your pinned from? i can't stop thinking about it...
i wrote the lyrics and then put them into suno, here is a couple more you might enjoy
2 notes · View notes
axesent · 11 months ago
Text
Tumblr media
Just a heads up to any non AI artists that use red bubble (among many more). They are allowing your work to be used by the LAION-5B data set for use in AI training. haveibeentrained.com is free to use
9 notes · View notes
aromanticduck · 1 year ago
Text
To be honest I would actually love to see what manner of fucked up art a computer can produce, but the people in charge of the computers insist on pushing them towards flavourless facsimiles of human-made art so companies can use them to cut costs.
12 notes · View notes
aitan · 5 months ago
Text
Tumblr media
Dalla pagina Instagram di dailychatgpt
5 notes · View notes
briz-z · 2 months ago
Text
my thoughts of the recent case of ai scraping of AO3 is that at least one (1) person within the thousands of fandoms and works that were scraped is rich enough and crazy enough to sue the user who did it. because justice.
2 notes · View notes